ONLINE TRACKING AND FIXING OF INVALID GUESS-DBAs 
IN SECONDARY INDEXES AND MAPPING TABLES 
ON PRIMARY B+TREE STRUCTURES 



Field of the Invention 

The present invention relates to organizing and accessing data in a database using 
database indexes. In particular, the present invention relates to auxiliary structures such as 
mapping tables and secondary index structures for indexing tables stored as primary B+trees. 
More particularly, the present invention relates to methods and structures for efficient 
maintenance of mapping tables and secondary index structures for a primary B+tree structure as 
the primary B+tree structure is updated. 

Background of the Invention 

In a typical relational database system, users store, update, and retrieve information by 
interacting with user applications. The applications respond to a user's interaction by submitting 
commands to a database application, or server, responsible for maintaining the database. The 
database server responds to commands by performing the specified actions on the database. To 
be correctly processed, the commands must comply with the database language that the database 
server supports. One popular database language is commonly known as Structured Query 
Language (SQL). 



Various access methods may be utilized to retrieve data from a database. The access 
methods used to retrieve data may significantly affect the speed of the retrieval and the amount 
of resources consumed during the retrieval process. Many information retrieval applications 
make use of indices when performing content-based searches on the database data. Examples of 
database indices include R-trees, quadtrees, and B-trees. 

Database indices provide organization and reference to the data in a database to permit a 
user to find particular items of data in the database or determine relationships among the data in 
the database. Database indices can also permit relationships between the data in a database and 
data not included in the database to be determined. For example, an index can make it possible 
to determine location within a certain distance of a location defined in a database comprised of 
geographical location information. 

Summary of the Invention 

The present invention addresses problems associated with block addresses stored in 
secondary indexes and/or mapping tables for a primary B+tree structure becoming stale due to 
row movement in the primary B+tree structure caused by leaf block splits. No known solution 
exists for addressing the problems that the present invention resolves. 

The present invention provides a method for maintaining a system for database 
management. The method includes recording an address of a newly created block resulting from 
splitting of a leaf block of a primary B+tree and maintaining the address in a list as part of 
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primary B+tree metadata. 



Additionally, the present invention concerns a system for organizing a database index. 
The system includes a list as part of primary B+tree metadata. The list maintains addresses of 
newly created leaf blocks during split operation of the primary B+tree. 

Also, the present invention relates to a computer program product for performing a 
process for maintaining a database management system. The computer program product 
includes a computer readable medium and computer program instructions recorded on the 
computer readable medium and executable by a processor. The computer program instructions 
perform steps including recording a new address for a newly created block during splitting of a 
leaf block of a database and maintaining the new address in a list as part of primary B+tree 
metadata. 

Furthermore, the present invention provides a system for performing a process for 
maintaining a database management system. The system includes a processor that is operable to 
execute computer programunstructions. The system also includes a memory operable to store 
computer program instructions executable by the processor. The computer program instructions 
perform steps including recording a new address for a newly created block during splitting of a 
leaf block of a database and maintaining the new address in a list as part of primary B+tree 
metadata. 

Still other objects and advantages of the present invention will become readily apparent 
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by those skilled in the art from a review of the following detailed description. The detailed 
description below shows and describes preferred embodiments of the present invention, simply 
by way of illustration of the best mode contemplated of carrying out the present invention. As 
will be realized, the invention is capable of other and different embodiments and its several 
details are capable of modifications in various obvious respects, without departing from the 
invention. Accordingly, the drawings and description are illustrative in nature and not 
restrictive. 

Brief Description of the Drawings 

Objects and advantages of the present invention will be more clearly understood when 
considered in conjunction with the accompanying drawings, in which: 

Fig. 1 represents a block diagram of an embodiment of a database management system 
according to the present invention. 

Detailed Description of the Invention 

For the primary B+-tree structures, an additional mapping table can be created as 
described in a U.S. patent application filed on even date herewith for "Mapping Logical Row 
Identifiers For Primary B+Tree-Like Structures To Physical Row Identifiers", to Chong et al., 
and having attorney docket number 191 1 1.0038, to supporting bitmap indexes, which are 
described in a U.S. patent application filed on even date herewith for "Supporting Bitmap 
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Indexes on Primary B+tree Structures", to Chong et al., and having attorney docket number 
191 1 1.0040, the entire contents of the disclosures of both of which are hereby incorporated by 
reference. Also, bitmap indexes are described in greater detail in U.S. patent 5,363,098, for 
"Byte Aligned Data Compression", issued November 8, 1994, to Antoshenkov, the entire 
contents of the disclosure of which are hereby incorporated by reference. Additionally or 
alternatively, secondary indexes can be created with logical row identifiers as described in U.S. 
patent application serial number 09/473 ,073? to Chong et al., filed December 28, 1999, for 
"Database System Having Logical Row Identifiers", the entire contents of the disclosure of 
which is hereby incorporated by reference. Both mapping tables, which may be utilized for 
supporting bitmap indexes, and secondary indexes store physical data block address to speed up 
query performance. 

However, database block addresses (DBA), referred to as guess-DBAs, which may be 
stored in the mapping table, and secondary indexes can become stale due to movement of 
primary B+tree rows caused by leaf block splits. Prior to the present invention, the degree of 
staleness was not reflected in the guess-DBA quality statistics. Also, fixing invalid guess-DBAs 
in these structures involved doing a full sweep of the structure even if only a small portion of 
guess-DBAs had gone stale. 

The present invention addresses problems related to staleness of database block address 
stored in mapping tables and secondary index structures and/or any other index or structure that 
references a primary B+tree structure. Along these lines, according to the present invention, 
during a leaf block split the address or database block address of a new block into which some of 
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the rows move may be recorded. The DBA of that block may be maintained in a list as part of 
primary B+tree metadata. Also, a count of DBAs in the list may be maintained. 

In order to determine whether it is necessary to carry out the present invention, a measure 
5 of invalid guess-DBAs may be maintained. This ratio may be utilized to adjust the guess-DBA 
quality of mapping table and/or secondary indexes. According to one embodiment, the measure 
includes the ratio <count of DBAs>/<total # of leaf blocks> as a measure of invalid guess-DBAs 
in mapping table and/or secondary indexes. According to the present invention, the list of DBAs 
is maintained only when the ratio is less than a predetermined threshold, such as, for example, 
about 10%. 
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\n At the point the selected measure described above falls below the predetermined 

13 threshold, or any measure indicates that the quality of the mapping table, secondary index or 
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U other structure has fallen below a certain level, the present invention can include initiating 
15 operation, either explicitly by user or implicitly by the system, for revising the mapping table, 
||T secondary index and/or other structure. Typically, the revision includes a selective fix-up of 
corresponding mapping table and secondary index entries. 

The following describes one embodiment for carrying out the revisions in a mapping 
20 table. For all rows in the list of blocks, to fix mapping table entries, the values for the following 
elements may be retrieved from the primary B+tree: corresponding mapping table row 
identifiers and the DBA of the current block in the list. Next, these values may be sorted in the 
order of mapping table row identifiers. Then, mapping table rows corresponding to the mapping 
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table row identifiers may be retrieved and its guess-DBA component is updated, if it differs from 
the current DBA. 



In the context of a secondary index, entries in the index may be updated as described 
below. Fok all rows in the list of blocks, to fix secondary index entries, the values for the 
following elements may be retrieved from the primary B+tree: a secondary index key, a primary 
key, and a DBaW the current block in the list. Then, the values may be sorted in order of the 
(secondary index kW primary key) pairs. Next, an index row corresponding to the (secondary 
index key, primary k£V) pair may be retrieved and its guess-DBA component is updated if it 
differs from the current DBA. 

The updating of a mapping table and/or a secondary index structure may be carried out 
on-line. Alternatively, the updating may be carried out off-line. Regardless of when the updates 
are carried out, they may be carried out in a piece-wise manner, particularly when only the 
guess-DBA is being updated. Similarly, regardless of when the updating occurs, the updates 
may be committed in batches. 

The above discussion of updating relates to situations where the defined ratio falls below 
a threshold value. On the other hand if the measure is above a predetermined threshold, it may 
be desired to carry out other operations on guess-DBAs. 

The following describes an embodiment of a process for resorting of guess-DBAs to a per 
object based fixing. For each mapping table or secondary index a full scan of the object may be 
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performed. For each row of the mapping table or secondary index, the correct guess-DBA may 
be determined by traversing the primary B+tree structure only up to the penultimate level. Then, 
each row of the mapping table or secondary index may be updated with the correct guess-DBA. 
Subsequently, the correct guess-DBAs may be committed to the mapping table or secondary 
index in small batches. 

To permit the above-described methods to be carried out, the present invention also 
includes a system for organizing databases. The system includes a list of addresses of block 
newly created during split operation on the primary B+tree. Typically, the auxiliary structures 
that benefit from this invention are a mapping table and/or a secondary structure. However, the 
present invention could be utilized with other structures. The system according to the present 
invention can also include a count of database block addresses in the list. Additionally, the 
system can also include a ratio of database block addresses to total number of leaf blocks as a 
measure of invalid guess-DBAs. 

An exemplary block diagram of a database management system 100 according to the 
present invention is shown in Fig. 1 . A database management system typically includes a 
programmed general-purpose computer system, such as a personal computer, workstation, server 
system, and minicomputer or mainframe computer. The embodiment of the database 
management system 100 shown in Fig. 1 includes processor (CPU) 102, input/output circuitry 
104, network adapter 106, and memory 108. CPU 102 executes program instructions in order to 
carry out the functions of the present invention. Typically, CPU 102 is a microprocessor, such as 
an INTEL PENTIUM® processor, but may also be a minicomputer or mainframe computer 
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processor. 



Input/output circuitry 104 provides the capability to input data to, or output data from, 
computer system 100. For example, input/output circuitry may include input devices, such as 
keyboards, mice, touchpads, trackballs, scanners, etc., output devices, such as video adapters, 
monitors, printers, etc., and input/output devices, such as, modems, etc. Network adapter 106 
interfaces database management system 100 with network 1 10. Network 1 10 may be any 
standard local area network (LAN) or wide area network (WAN), such as Ethernet, Token Ring, 
the Internet, or a private or proprietary LAN/WAN. 

Memory 108 stores program instructions that are executed by, and data that are used and 
processed by, CPU 102 to perform the functions of the present invention. Memory 108 may 
include electronic memory devices, such as random-access memory (RAM), read-only memory 
(ROM), programmable read-only memory (PROM), electrically erasable programmable read- 
only memory (EEPROM), flash memory, etc., and electro-mechanical memory, such as magnetic 
disk drives, tape drives, optical disk drives, etc., which may use an integrated drive electronics 
(IDE) interface, or a variation or enhancement thereof, such as enhanced IDE (EIDE) or ultra 
direct memory access (UDMA), or a small computer system interface (SCSI) based interface, or 
a variation or enhancement thereof, such as fast-SCSI, wide-SCSI, fast and wide-SCSI, etc, or a 
fiber channel-arbitrated loop (FC-AL) interface. 

I^lemory 108 includes a plurality of blocks of data, such as new address block 112, list 
block 1 14, and ratio block 1 16, and a plurality of blocks of program instructions, such as 
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processing routines 1 18 and operating system 120. New address block 1 12 stores a plurality of 
new addresses for rows split from a leaf block that have been received by the database 
management system 100 as a database index is modified. List block 1 14 stores a list of the new 
addresses as metadata for the database index. Ratio block 1 16 stores the ratio of database 
addresses to totaWimber of leaf blocks that may be used to evaluate invalid guess-DBAs. 
Processing routines^ 18 are software routines that implement the processing performed by the 
present invention. Op\rating system 120 provides overall system functionality. 

It is important to note that while the present invention has been described in the context 
of a fully functioning data processing system, those of ordinary skill in the art will appreciate 
that the processes of the present invention are capable of being distributed in the form of a 
computer readable medium of instructions and a variety of forms and that the present invention 
applies equally regardless of the particular type of signal bearing media actually used to carry out 
the distribution. Examples of computer readable media include recordable-type media such as 
floppy disc, a hard disk drive, RAM, and CD-ROM's, as well as transmission-type media, such 
as digital and analog communications links. 

The foregoing description of the invention illustrates and describes the present invention. 
Additionally, the disclosure shows and describes only the preferred embodiments of the 
invention, but as aforementioned, it is to be understood that the invention is capable of use in 
various other combinations, modifications, and environments and is capable of changes or 
modifications within the scope of the inventive concept as expressed herein, commensurate with 
the above teachings, and/or the skill or knowledge of the relevant art. The embodiments 
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described hereinabove are further intended to explain best modes known of practicing the 
invention and to enable others skilled in the art to utilize the invention in such, or other, 
embodiments and with the various modifications required by the particular applications or uses 
of the invention. Accordingly, the description is not intended to limit the invention to the form 
disclosed herein. Also, it is intended that the appended claims be construed to include alternative 
embodiments. 
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